Extracting meta tags attribute using wget [migrated]
Posted
by
Amit
on Pro Webmasters
See other posts from Pro Webmasters
or by Amit
Published on 2011-06-26T09:35:32Z
Indexed on
2011/06/26
16:31 UTC
Read the original article
Hit count: 261
html
|meta-keywords
I have a file having some URLs per line. I need to extract the "keywords" present in the tags i.e. if there is meta tag for "keywords" then i want to get "content" value for it. Example: if the web-page has this meta-tag then for that URL i want "wikipedia,encyclopedia" to be extracted.
One approach is to download the web-page using "wget" and then parse it using some standard HTML parser.
I was wondering is there any better way to do this without downloading the entire web-page.
© Pro Webmasters or respective owner